Collective Content Selection for Concept-to-Text Generation

نویسندگان

  • Regina Barzilay
  • Mirella Lapata
چکیده

A content selection component determines which information should be conveyed in the output of a natural language generation system. We present an efficient method for automatically learning content selection rules from a corpus and its related database. Our modeling framework treats content selection as a collective classification problem, thus allowing us to capture contextual dependencies between input items. Experiments in a sports domain demonstrate that this approach achieves a substantial improvement over context-agnostic methods.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Investigating Content Selection for Language Generation using Machine Learning

The content selection component of a natural language generation system decides which information should be communicated in its output. We use information from reports on the game of cricket. We first describe a simple factoid-to-text alignment algorithm then treat content selection as a collective classification problem and demonstrate that simple ‘grouping’ of statistics at various levels of ...

متن کامل

Using Integer Linear Programming in Concept-to-Text Generation to Produce More Compact Texts

We present an ILP model of concept-totext generation. Unlike pipeline architectures, our model jointly considers the choices in content selection, lexicalization, and aggregation to avoid greedy decisions and produce more compact texts.

متن کامل

Probabilistic Approaches for Modeling Text Structure and their Application to Text-to-Text Generation (Invited Talk)

Text-to-text generation aims to produce a coherent text by extracting, combining and rewriting information given in input texts. Examples of its applications include summarization, answer fusion in question-answering and text simplification. At first glance, text-to-text generation seems a much easier task than the traditional generation set-up where the input consists of a non-linguistic repre...

متن کامل

Inducing Document Plans for Concept-to-Text Generation

In a language generation system, a content planner selects which elements must be included in the output text and the ordering between them. Recent empirical approaches perform content selection without any ordering and have thus no means to ensure that the output is coherent. In this paper we focus on the problem of generating text from a database and present a trainable end-to-end generation ...

متن کامل

مسأله حضور در فضا: آگاهی و عاملیت فضایی با تاکید بر فضای عمومی شهری

Public space is the realm of the concrete and substantial presence of the different social groups with different behavior patterns. The concept of space in this sense is an entity that, by the people and through individual and collective action and social relations are formed. The presence of people in the space-in a way that is free from domination, could Strength the urban life. this paper, b...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005